The Role of Genome Accessibility in Transcription Factor Binding in Bacteria
نویسندگان
چکیده
ChIP-seq enables genome-scale identification of regulatory regions that govern gene expression. However, the biological insights generated from ChIP-seq analysis have been limited to predictions of binding sites and cooperative interactions. Furthermore, ChIP-seq data often poorly correlate with in vitro measurements or predicted motifs, highlighting that binding affinity alone is insufficient to explain transcription factor (TF)-binding in vivo. One possibility is that binding sites are not equally accessible across the genome. A more comprehensive biophysical representation of TF-binding is required to improve our ability to understand, predict, and alter gene expression. Here, we show that genome accessibility is a key parameter that impacts TF-binding in bacteria. We developed a thermodynamic model that parameterizes ChIP-seq coverage in terms of genome accessibility and binding affinity. The role of genome accessibility is validated using a large-scale ChIP-seq dataset of the M. tuberculosis regulatory network. We find that accounting for genome accessibility led to a model that explains 63% of the ChIP-seq profile variance, while a model based in motif score alone explains only 35% of the variance. Moreover, our framework enables de novo ChIP-seq peak prediction and is useful for inferring TF-binding peaks in new experimental conditions by reducing the need for additional experiments. We observe that the genome is more accessible in intergenic regions, and that increased accessibility is positively correlated with gene expression and anti-correlated with distance to the origin of replication. Our biophysically motivated model provides a more comprehensive description of TF-binding in vivo from first principles towards a better representation of gene regulation in silico, with promising applications in systems biology.
منابع مشابه
Mapping of Transcription Factor Binding Region of Kappa Casein (CSN3) Gene in Iranian Bacterianus and Dromedaries Camels
κ-casein is a glycosilated protein in mammalian milk that plays an essential role in the milk micelles. Control of κ-casein expression reflects this essential role, although an understanding of the mechanisms involved lags behind that of the other milk protein genes. Transcriptional regulation, a first mechanism for controlling the development of organisms, is carried out by transcription facto...
متن کاملMapping of Transcription Factor Binding Region of Kappa Casein (CSN3) Gene in Iranian Bacterianus and Dromedaries Camels
κ-casein is a glycosilated protein in mammalian milk that plays an essential role in the milk micelles. Control of κ-casein expression reflects this essential role, although an understanding of the mechanisms involved lags behind that of the other milk protein genes. Transcriptional regulation, a first mechanism for controlling the development of organisms, is carried out by transcription facto...
متن کاملPost-translational changes of histones, methylation level, and ERβ protein level in the cumulus cell genome of infertile women with endometriosis
Background: Endometriosis (which affects up to 50% of infertile women) is one of the major causes impacting female infertility. Endometriosis, defined as the presence of endometrial glands and stroma outside the uterine tissue, causes a wide range of functional disorders in the process of follicular development and changes in the follicular milieu, resulting in the formation of an incompetent o...
متن کاملHomocysteine Induces Heme Oxygenase-1 Expression via Transcription Factor Nrf2 Activation in HepG2 Cells
Background: Elevated level of plasma homocysteine has been related to various diseases. Patients with hyperhomocysteinemia can develop hepatic steatosis and fibrosis. We hypothesized that oxidative stress induced by homocysteine might play an important role in pathogenesis of liver injury. Also, the cellular response designed to combat oxidative stress is primarily controlled by the transcripti...
متن کاملZelda is differentially required for chromatin accessibility, transcription factor binding, and gene expression in the early Drosophila embryo.
The transition from a specified germ cell to a population of pluripotent cells occurs rapidly following fertilization. During this developmental transition, the zygotic genome is largely transcriptionally quiescent and undergoes significant chromatin remodeling. In Drosophila, the DNA-binding protein Zelda (also known as Vielfaltig) is required for this transition and for transcriptional activa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PLoS computational biology
دوره 12 4 شماره
صفحات -
تاریخ انتشار 2016